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1 . Introduction 

Reference  15]  formulated  a queueing  model  to  address  the  problem 
of  determining  the  optimum  number  of  spares  and  repair  channels  for  a pop- 
ulation of  stochastic  failing  units.  The  model  assumed  that  a requirement 
for  a high  availability  of  spares  was  Imposed  and  approximated  the  multi- 
stage service  system  with  a series  queue.  Under  the  same  assumptions  as 
in  Reference  (5],  i.e. , exponential  failure  and  service  times,  this  paper 
formulates  the  problem  as  a cyclic  queue  for  which  an  exact  solution  is 
tractable.  This  exact  model  can  be  considered  as  an  extension  of  the 
classic  machine  repair  problem  with  spares.  Reference  [1], 

Section  2 of  this  paper  deals  with  definitions  and  notation.  The 
classic  machine  repair  problem  and  its  logical  extension  to  many  repair 
stages  is  first  discussed.  Then  a cyclic  queue  is  defined.  Next  the  ex- 
tended machine  repair  problem  is  framed  as  a cyclic  queueing  system,  for 
which  the  literature  has  applicable  results.  The  section  concludes  with 
the  definition  of  availability. 

Section  3 reviews  and  categorizes  three  key  results  from  the  lit- 
erature in  the  field  of  networks  and  cyclic  queues.  Section  A formulates 
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both  the  approximate  model  of  [5]  and  the  exact  model  of  this  paper. 
Section  5 compares  the  accuracy  and  computational  characteristics  of  the 
two  models  for  gas  turbine  engine  data  from  [5].  Section  6 presents  the 
conclusions. 

2.  Definitions  and  Notation 


2.1  Classic  Machine  Repair  Problem 

The  classic  machine  repair  problem  with  spares  consists  of  a fixed 
number  of  identical  machines  of  which  initially  M are  operating  and  y 
are  spares,  i.e.,  the  fixed  total  population  is  M+y  . By  identical  is 
meant  the  machines  have  the  same  distributions  for  failure  and  service 
times,  and  that  there  are  no  priorities  or  queue  disciplines  other  than 
first  come,  first  served. 

The  M machines  are  in  parallel  and  are  independent.  When  one 
fails  in  service,  it  is  instantaneously  replaced  by  a spare,  if  one  is 
available.  If  not,  less  than  M machines  will  operate  until  a repaired 
machine  becomes  available.  Simultaneously,  the  failed  machine  goes  in- 
stantaneously into  a repair  facility  from  which,  once  repaired,  it  goes 
instantaneously  into  the  spare  parts  pool  (or  directly  into  service,  if 
less  than  M machines  are  operating).  This  process  is  shown  in  Figure  1. 

The  following  assumptions  are  now  made: 

(a)  the  system  failure  rate  is  proportional  to  the  number  of 
operating  machines, 

(b)  each  machine  has  exponential  failure  times  with  mean  1/X  , 

(c)  there  are  c parallel  servers  (repair  channels)  in  the 
repair  facility, 

(d)  each  server  has  exponential  service  times  with  mean  l/p  . 

Letting  n equal  the  number  of  "down"  machines  in  the  repair  facility, 
the  problem  becomes  a Markovian  birth-death  process  with  parameters 


2 


I 
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0 < n < y 

y < n < M+y  , 
n > H+y 

0 < n < c 
n > c 

The  solution  for  , the  probability  that  there  are  n machines 

in  the  repair  phase,  can  be  found  on  page  123  of  [41. 

2.2  Extension  of  the  Machine  Repair 
Problem  to  Many  Stages 

A logical  extension  to  the  classic  machine  repair  problem  with 
spares  is  to  introduce  more  than  one  step  into  the  repair  phase.  These 
additional  steps,  or  stages,  could  represent  the  removal  of  a failed 
machine,  the  transportation  to  the  repair  facility,  the  repair  itself, 
transportation  from  the  repair  facility,  etc.,  until  it  is  returned  to  the 
spare  parts  pool.  Such  a model  is  shown  in  Figure  2,  where  each  additional 
stage  consists  of  a number  of  parallel  servers  with  exponential  service  times. 

The  straightforward  application  of  the  Markovian  birth-death  process 
is  no  longer  directly  applicable  as  before.  Reference  [5]  solves  this 
extended  model  by  Imposing  a high  availability  constraint  on  the  spare 
parts,  then  making  a simplifying  assumption  that  the  operating  stage  acts 
as  an  Infinite  source  (true  Poisson)  input  process  to  a series  (multi-stage 
repair)  queue.  The  model  of  this  paper  solves  the  extended  model  by  treating 
it  as  a cyclic  queue,  without  any  requirement  for  high  availability. 

2.3  Cyclic  Queueing  System 

Figure  3 represents  a cyclic  queueing  system.  There  is  a total  of 
N identical  customers  in  the  K separate  stages  of  the  system.  Each  stage 
consists  of  Cj  parallel  servers,  each  with  exponentially  distributed  service 

times  with  mean  p^  , i»l,2,...,K  . The  system  is  closed,  i.e.,  no 


and 


MA 

(M-n+y ) A 
0 


np  , 

cp  , 
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customers  enter  or  leave  the  system.  Upon  being  served,  a customer  goes 
directly  to  the  next  stage.  If  a server  is  free,  the  customer  goes  directly 
into  service.  If  not,  the  customer  forms  or  joins  a queue,  which  can  never 
be  blocked  (i.e.,  infinite  waiting  room  between  stages).  Upon  completing 
the  Kth  stage,  the  customer  goes  directly  to  the  first  stage  and  starts 
the  cycle  over. 


Letting  n^  , c^  , and  p^  represent  the  number  of  customers, 

parallel  servers,  and  exponential  service  mean  rate,  respectively,  at  the 
ith  stage,  i=l,2,...,K  , we  have  the  following  relationships: 


service  rate  at  ith  stage 


Vi  • 

n . < 
i — 

ci 

Vi  * 

ni > 

ci 

and 


total  number  of  customers  in  the  K stages  of  the  system  = N = £ n.  . 

i = l 1 


2.4  Machine  Repair  Problem  as  a Cyclic  Queue 

The  extended  machine  repair  problem  with  spares  will  now  be  framed 
as  a cyclic  queue.  Of  the  K stages  of  the  cyclic  queue,  the  first  stage 
represents  the  operating  machines.  The  c^  parallel  servers  at  the  first 

stage  can  be  considered  to  be  the  M operating  machines  of  the  classic 
problem  (c^  * M)  . Machines  that  fail  in  Stage  1 go  directly  to  Stage  2, 

i 

a removal  phase.  There  are  c^  parallel  servers  there  which  represent  the 

number  of  "machine  remqvers"  present.  After  removal,  machines  go  to  Stage 
3,  say  a transportation  phase,  then  to  Stage  4,  etc.  At  the  ith  stage 
(i*l,2 , . . . ,K)  , there  are  c ^ parallel  servers,  each  with  an  exponential 

service  rate  p^  . Note  that  c}  can  be  set  to  the  total  population  size, 

N , in  the  system  (i.e.,  effectively  set  to  infinity)  to  represent  an  ample 
server  stage,  with  no  possibility  of  a queue  forming  in  front  of  that  stage 
This  might  be  appropriate  for  removal  and  transportation  phases. 
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After  leaving  the  Kth  stage,  the  machine  returns  tn  Stage  1 ready 
for  service.  If  less  than  c (i.e.,  M)  machines  are  operating,  the  newly 

repaired  machine  goes  right  into  service.  If  all  c^  servers  are  busy 

(i.e.,  all  M are  "up")  the  newly  repaired  machine  either  starts  a queue  or 
Joins  an  existing  one  in  front  of  Stage  1.  This  queue  represents  the  spare 
machines  on  hand  (i.e.,  the  inventory). 

Table  1 compares  the  terminology  between  the  machine  repair  problem 
and  the  cyclic  queueing  system. 


TABLE  1 

COMPARATIVE  TERMINOLOOY 


NOTATION 

MACHINE  REPAIR  PROBLEM 

CYCLIC  QUEUE 

Number  of  Operating 
Machines 

M 

C1 

Machine  Failure  Rate 

X 

^1 

Total  Number  of  Machines 

M+y 

N 

Number  of  Spares 

y 

N"  C! 

Size  of  Inventory 
(Spare  Parts  Pool) 

M+y  - Number 
in  Repair  Phase 

Queue  Size 
at  Stage  1 

2.5  Definition  of  Availability 

Suppose  there  are  three  spare  machines  in  the  inventory,  i.e.,  a 
queue  of  size  three  in  front  of  Stage  1.  If  an  operating  machine  fails,  a 
spare  is  instantaneously  pulled  from  the  inventory  and  put  into  service. 
Therefore,  at  the  time  of  a failure,  a spare  is  available.  Now  assume 
that  the  queue  has  shrunk  to  one  spare.  Again,  if  an  operating  machine 
fails,  a spare  is  available.  At  this  point,  the  queue  size  is  zero  and 
no  spares  are  available,  but  the  operating  system  is  unconcerned;  the 
operating  system  looks  to  the  inventory  only  when  an  operating  machine 
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fails,  and  at  no  other  time.  By  the  time  another  machine  falls,  a newly 
repaired  machine  might  have  arrived  from  the  Kth  phase,  resulting  in  a 
positive  queue  again.  In  other  words,  even  though  temporarily  no  spares 
were  available,  the  operating  system  does  not  know  or  care  about  it 
unless  a failure  occurs.  As  long  as  M machines  are  up  without  interrup- 
tion, the  availability  of  spares  is  not  a factor  in  the  operation.  There- 
fore, the  definition  of  availability  involves  conditioning  on  the  occurrence 
of  a failure.  More  precisely, 

AVAILABILITY  = probability  that  the  spares  inventory 
is  not  empty  given  that  a failure  is 
about  to  take  place 

= probability  that  the  queue  size  at 
Stage  1 is  greater  than  0 given  that 
a failure  is  about  to  take  place. 


An  algebraic  expression  for  these  "failure  point  probabilities"  in 
terms  of  the  "general  time  probabilities"  can  be  derived  from  Bayes'  theo- 
rem. First,  define^ 


P(n^)  = general  time  probability  that  there 
are  n^  customers  at  Stage  1 

and 

Q(n^)  * conditional  (or  failure  point)  prob- 
ability that  there  are  n^  customers 

at  Stage  1 given  that  a failure  is 
about  to  occur. 

By  Bayes'  theorem: 


QO^)  - 

Pr {failure  about  to  occur  at  Stage  1 1 n ^ customers  at  Stage 
N 

£ Prtfallure  about  to  occur |n  } • P(n  ) 

— .n  1 


P(nx) 


(1) 


All  probabilities  are  assumed  to  be  steady-state. 
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where  N Is  the  maximum  number  of  customers  in  the  system.  We  will 
return  to  Equation  (1)  in  the  development  of  the  specific  models. 

With  Q(n^)  so  defined,  we  can  now  define  availability  as: 

N 

AVAILABILITY  = £ Q(n  ) . 

Vcl+1 

It  remains  now  to  find  the  P(nj)  using  the  theory  of  cyclic  queues. 

3.  Literature  Review  on  Cyclic  Queues  and  Networks 

In  the  course  of  research  for  this  paper  more  than  40  references 
on  the  subject  of  cyclic  queues  and  networks  were  reviewed  and  categorized. 
Networks  were  included  because  cyclic  queues  can  be  considered  as  a subset 
of  networks.  Only  the  three  most  relevant  references  to  this  paper's 
application  are  discussed  here.  The  others,  dealing  with  variations  such 
as  travel  time  between  stages  (see  [9]),  are  not  included. 

Table  2 shows  the  key  features  of  the  models  presented  in  the  three 
pertinent  references.  All  of  the  models  are  multi-stage,  all  stages  have 
only  exponential  service,  there  is  no  travel  time  between  stages,  and  all 
customers  are  identical  (i.e.,  single  class  of  customer).  The  no  travel 
time  restriction  is  not  crucial  since  travel  time  between  stages  can  be 
handled  by  simply  introducing  an  ample  server  transportation  stage  between 
any  two  stages  in  the  cyclic  queue. 

3.1  J.  R.  Jackson,  Reference  [6] 

In  his  1957  paper,  Jackson  proved  a theorem  for  networks  in  a 
steady-state  condition.  The  theorem  yields  the  steady-state  joint  probability 
for  the  number  of  customers  at  each  stage.  The  network  has  multiple  stages. 

Each  stage  has  parallel  channels,  with  all  servers  at  a given  stage  having 
the  same  exponential  service  time  distribution.  Each  stage  can  also  have 
external  Poisson  input  to  it  and  can  output  from  the  system.  Customers 
could  go  from  one  stage  to  any  other  stage  in  the  network  (feedback/ feedforward) 
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according  to  some  known  probability  distribution.  In  1963  [7],  Jackson 
extended  his  theorem  to  allow  state-dependent  service  at  any  stage. 

Before  describing  the  Jackson  results,  we  will  first  consider  a 

single  M/M/c  queue,  with  input  Poisson  stream  parameter  A and  exponential 

service  parameter  p . From  the  well-known  results  for  p , the  steady- 

n 

state  probability  that  there  are  n customers  in  the  system  (service  plus 
queue),  given  the  parameters  A , p , and  c , are: 


P (A.p.c) 
n 


( G)'  h • 

G)"  p° ! " 


n < c 


(3) 


> c , 


(for  — < 1)  . 
cp 


I 


Jackson  considers  a network  such  as  the  one  in  Figure  4,  where  a 
customer's  path  through  the  network  is  influenced  by  p..  , the  probability 

that  a customer  leaving  the  jth  stage  goes  to  stage  i (J»1,2,...,K  , 

K 

i*0,l,...,K)  ; i-0  represents  leaving  the  system  and  £ p - 1 for 

t-0  11 

all  j . He  defines  rx.  as  the  Poisson  parameter  of  the  external  input  to 
stage  i,  and  to  be  the  total  mean  input  rate  to  stage  i.  Therefore, 


t 

I 


i • °i + J,  Vji 


(4) 


F.sser.tially , Jackson's  theorem  states  that  the  steady-state  joint  probability 
of  customers  at  Stage  1,  n^  at  Stage  2,  ...»  n at  Stage  K,  can  be 

determined  by  first  assuming  that  each  stage  is  an  independent  M/M/c  queue 
with  input  parameter  and  service  parameter  rate  p , then  using 

Equation  (3)  at  each  stage,  and  then  multiplying  the  results  to  obtain  the 
Joint  probability  distribution.  This  can  he  written  as  (for  T < p^c^  , 

for  al 1 1*1, . . . ,K) 
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_ (r  .Vi  ,c  , i«l,...,K) 
■nk  1 1 1 


Pn1(I'l’lJl*Cl)Pn2(r2*lJ2*C2)---pnK(rK-UK’CK)  ’ 


(5) 


where  p (r.,u.,c.)  Is  determined  from  Equation  (3)  with  the  parameters 
n^  i i i 

substituted  accordingly. 


Note  that  Jackson's  theorem  does  not  assert  independence  of  the 
stages,  just  that  if  independence  is  assumed,  the  resulting  joint  probability 
is  correct.  In  his  proof,  he  first  sets  up  the  steady-state  difference 
equations,  then  postulates  the  solution,  and  finally  shows  that  the  solu- 
tion satisfies  the  equations. 


3.2  E.  Koenigsberg,  Reference  [8] 

In  his  1958  paper,  Koenigsberg  looked  at  a cyclic  queue  with  multiple 
stages,  but  with  only  a single  server  at  each  stage.  The  service  times  were 
exponential.  Since  it  was  a cyclic  queue  and  not  a network,  there  was  no 
exogenous  input  or  output  and  no  feedback  or  feedforward,  except  directly 
to  the  next  stage. 

» 

The  method  of  Koenigsberg  was  to  set  up  the  differential  difference 

(N+K”  1\ 

) equations 

in  the  same  number  of  unknowns  (which  is  the  number  of  ways  to  put  N in- 
distinguishable items  into  K boxes,  any  number  to  a box),  was  intractable 
for  a general  solution.  Koenigsberg  then  postulated  the  solution,  which 
satisfied  the  set  of  simultaneous  state  equations  when  substituted  back  in. 
Were  it  not  for  the  limitation  to  a single  server  at  each  state,  we  could 
have  used  Koenigsberg’ s results  directly  for  the  development  of  an  exact 
model  for  the  extended  machine  repair  problem. 

3.3  R.  Swersey,  Reference  [10] 

Swersey  noted  in  his  1967  paper  that  cyclic  queues  are  subsets  of 
networks  and  he  applied  Jackson's  network  results  of  [6]  and  [7]  to  a 
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Koenigsberg  type  of  cyclic  queue.  This  amounts  to  setting  to  zero 

for  all  1 and  p.j  to  one  for  j“i-l  and  to  zero  otherwise  in  Equation 
(4).  Therefore,  T = I* for  all  i , i“l,2,...,K  (with  the  understand- 


ing that  rQ  * ).  Thus,  f is  a constant  (to  be  determined)  throughout 


the  system.  This  means  that  in  steady  state,  the  input  rate  to  any  stage 
is  a constant,  and  if  the  cyclic  queue  is  cut  at  any  point  between  any  two 
stages,  the  same  constant  flow  rate  of  customers,  T , would  be  observed. 

Under  the  restrictions  that  the  sum  of  customers  is  constant  (N) 
and  that  the  joint  probability  must  integrate  to  unity,  the  Jackson  theorem, 
as  applied  by  Swersey,  can  be  used  to  solve  for  steady-state  probabilities 
for  cyclic  queues. 


4.  Formulation  of  the  Exact  and  Approximate  Models 
4.1  The  Exact  Model 

By  the  Jackson  theorem.  Equation  (5)  gives  the  steady-state  joint 
probability  where  Equation  (3)  yields  the  Individual  factors  in  the  ex- 
pression. 


Applying  the  Swersey  analysis  (i.e.,  T a constant)  yields 


where  p is  the  steady-state  joint  probability  that  there  are  n 

Hj  » • • • » 1 

customers  at  Stage  i,  for  i going  from  1 to  K.  For  i«l,2,...,K  : 


n, ! ; 

< 

c. 

i * 

, ni~Ci 
ci!  Ci  ; 

i 

i 

ni 

> 
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K 

( n.  * number  of  customers  at  Stage  i ( £ n = N ) 

i-1  1 

Cj  **  number  of  parallel  servers  at  Stage  1 

lij  ” mean  service  rate  of  each  server  at  Stage  1 

pQ  ■ steady-state  probability  that  there  are  zero  customers 
i at  the  ith  stage  when  the  ith  stage  is  treated  as  an 
independent  M/M/c^  queue  with  Poisson  input  T and 

service  rate  . 


K 

S - {n  , i-l,2,...,K  : £ n « N}  . (8) 

i«l 

In  the  above  "constant  of  integration,"  , the  summation  is  taken 

over  all  possible  partitions  of  the  N customers  into  the  K stages. 

Note  that  the  joint  probability  in  Equation  (6)  no  longer  involves  the  un- 
known steady-state  system  flow  T , since  it  has  been  incorporated  into 
the  constant  of  integration. 

To  determine  the  marginal  probability  for  the  number  of  customers 
at  Stage  1,  n^  , we  must  sum  the  joint  probability  over  all  partitions  of 

N-n^  into  K-l  stages,  i.e.,  over  the  set  , where 
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K 

S2  “ (n  j , i“2, 3, . . . ,K  : ^ n(  ■=  N-n^l  . (9) 

1=2 

Letting  P^n^)  represent  the  steady-state  "exact"  (since  we  will 

be  comparing  th i exact  model  to  an  approximate  model  mentioned  later) 
marginal  probability  that  there  are  customers  at  Stage  1, 


w 


(10) 


To  determine  the  availability  of  spares,  we  must  first  determine 

the  failure  point  probability.  Equation  (1).  (Note  that  P (n  ) , Equa- 

E X 

tion  (10),  is  a general  time  probability.)  Since  we  have  an  exponential 
service  discipline, 

Prffailure  about  to  occur  at  Stage  1 |n^  customers  at  Stage  1} 

nl*Jl^t  + 5 £. 

c^p^At  + o(At)  ; n^  > c 


Equation  (1)  can  now  be  written  (with  subscript  E added  to  denote  this 

exact  model)  after  dividing  numerator  and  denominator  by  At  and  taking 

lim  , as 
At-+0 


W 


A3nlUl 

W i 

n^  < c 

A3Vl 

W : 

n^  > c 

where 


(11) 


s1  = L Vl  W 


n^O 


(12) 


+ £ ..  Vl  W 


W1 
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Equation  (2),  the  expression  for  the  availability,  now  becomes 
(again  with  the  subscript  E added) 


AVAIL 


E 


N 

1 W ' Vl“l 

nrci+1 


N 

I 


nrci+1 


W 


(13) 


Equation  (13)  is  the  availability  of  the  system.  It  requires  Equa- 
tions (12),  (10),  and  (7)  to  compute,  but  the  availability  can  be  expressed 
totally  in  terms  of  the  system  parameters  p^.c^  (1=1,..., K)  , N , and  K . 


Note  that  the  constant  A^  , in  Equation  (12),  requires  the  evalua- 
tion of  Pg(n^)  for  aH  nj  from  0 to  N,  the  maximum  in  the  system.  This 

is  a key  point  in  discussing  the  computational  efficiency  of  the  exact  and 
approximate  models. 

4.2  The  Approximate  Model 

Consider  an  M/M/c  queue  with  input  parameter  A and  service  rate 
p . The  steady-state  probability  that  there  are  n in  the  system  is  given 
by  Equation  (3).  Burke,  in  his  1956  paper  [2],  proved  that  the  output  from 
that  queue  was  Poisson  with  parameter  A . Thus,  the  output  is  independent 
of  the  service  rate  mean,  p , as  long  as  the  service  time  is  exponentially 
distributed.  Burke  also  reasoned  in  his  1972  paper  [3],  that  since  A in 
results  in  A out,  then  a series  of  M/M/c  queues  could  be  formed  with  A 
in  at  one  end  and  A out  at  the  other,  as  Indicated  in  Figure  5.  Each 
stage  is  independent  of  the  other  stages  and  the  steady-state  joint  probabil- 
ity could  be  found  by  multiplying  the  probability  from  Equation  (3)  for  each 
stage. 


The  crux  of  the  approximate  model  is  the  assumption  that  the  first 
stage,  the  operating  machine  stage,  is  almost  always  operating  at  full  ca- 
pacity, i.e.,  all  Cj  machines  are  up.  (This  is  the  implication  of  a high 

availability  constraint.)  Then  the  output  from  the  first  stage  is  a pure 
Poisson  process  with  parameter  CjP^  and  the  first  stage  acts  like  an 

Infinite  source  input  to  the  rest  of  the  system. 


- 18  - 


T-  346 


Stages  2 through  K can  now  be  treated  as  the  Burke  series  of  Figure 
5,  and  the  joint  probability  that  there  are  n^  in  Stage  2,  n^  in  Stage 

3,  . n in  Stage  K is 


. tof ! i. P m* ■ 

n2,n3 nK  \ U2  ) b2  °2  \ UK  / 


bK  P°K 


(14) 


where  for  i=2,...,K 


bi  " 


n , ! 


. ni“Ci 
ci  ci 


ni  < 


ni  > ci 


l°r  1=2, . . . ,K  and  - probability  there  are  zero  c 

tomers  at  the  ith  stage. 


us- 


After  summing  over  all  possible  ni  to  get  the  "constant  of  inte- 
gration," 


n2»ny  • • • • nR  2 


5 i. . 

1=2  \ Hi  / bi 


where 


.-1 


= n l (^)  L-  - J j \ _L 

i=2  n1=0  \ Pi  / bi  1=2  | ni=0  V l'i  / ni! 


1 /Vl\V  Cibl  \ 

:i!  \ Pi  / \CiWi"ClUl/ 


(15) 


(16) 


At  this  point  there  is  no  restriction  on  £ n because  with  an 

i=2  1 

infinite  source  at  Stage  1,  any  system  size  is  possible.  However,  in  the 

K 

real  environment,  there  is  a constraint,  namely  £ n ■ N-n  , where  N 

i=2  1 1 
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is  the  total  population  In  the  system  and  n(  Is  the  number  at  Stage  1,  the 
"infinite  source."  Therefore,  the  probability  that  there  are  at  Stage 

1 is  set  equal  to  the  probability  that  N-n^  are  in  the  rest  of  the  system, 
i.e.,  the  sum  of  the  probabilities  of  all  the  possible  ways  N-n^  customers 
can  be  partitioned  into  K-l  stages. 


Letting  be  the  approximate  model's  steady-state  probability 

that  there  are  n^  at  Stage  1,  we  have 


where 


w 


K 

{n  , i-2,3,...,K  : £ n = N-n  } , 

1=2  1 


(17) 


8iven  by  Equation  (16),  and  Equation  (15)  has  already  been  sub- 
stituted for  the  joint  probability  function. 


It  is  well  known  that  for  M/M/c  queues, 

W * W ' (18) 

By  the  definition  of  availability.  Equation  (2),  we  can  now  write  (with 
the  subscript  A added  to  denote  the  approximate  model) 


N 

availa  = l q (n  ) , 

n =c,+l 


N 

l 

c,  +1 


w 


(19) 
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Equation  (19)  is  the  availability  of  the  system.  It  requires  Equations 
(17)  and  (16)  to  compute,  but  the  availability  can  be  expressed  totally 
in  terms  of  the  system  parameters  Pj  , c{  (i=l,...,K)  , N , and  K . 


Note  that  Equation  (19)  requires  only  the  evaluation  of  P (n  ) 

A J. 

for  n^  > c^  . This  is  a key  point  in  discussing  the  computational  effi- 
ciency of  the  exact  and  approximate  models. 

4.3  Summary  of  Algebra  and  Some  Inequalities 


Exact  model:  The  exact,  marginal,  general  time  probability  that 

there  are  n^  customers  at  the  first  stage  is 


where 


(10) 


(7) 


K 

S * {n  , i«2,...,K  : I n ” N-n  } , 

i*2  1 1 


K 

S - (n  , i-l,2,...,K  : £ n = N}  , 

i-1 


V 


V Ci 


ni'Ci 


n < r 

i - i 


n > c 

ni  i 


The  exact,  failure  point  probability  that  there  are  n^  customers  at 
the  first  stage  given  that  a failure  is  about  to  occur  is 


w 


fA3  Vl  W ; nl±cl 


A3  Vl  ^^l)  5 nl  " C1 


(11) 


- 22  - 


T-  346 


where 


.-1 


3 = * n Vl  PB(nl)  + * ^ C1U1  PE(nl) 


(12) 


=0 


nrci+1 


The  probability  that  there  are  spares  available  given  that  a failure  is 
about  to  occur  is 


N N 

AVA1Le  - l Q(n^ ) = A3  CjPj  l PE(nL)  . 


(13) 


nrci+1 


nrci+1 


Approximate  model:  The  approximate,  marginal,  general  time  probabil- 

itv  that  there  are  n^  customers  at  the  first  stage  is 


W - *2  l 


K , 

f c p 

y , 

" 1 
i“2 

' 1M1 

1 1 

^ yi  - 

) bi 

(17) 


where 


-1  = n j i J_ 

' i-2  n-oWi/  bi 


(16) 


and  , b^  as  above  in  the  exact  model. 


The  approximate  failure  point  probability  that  there  are  n^  cus- 
tomers at  the  first  stage  given  that  a failure  is  about  to  occur  is 


VV  ' W 


(18) 


The  probability  that  there  are  spares  available  given  that  a failure  is 
about  to  occur  is 


N 


AVAIL. 


I vv  * 

Vci+1 

Inequalities:  With  the  above  algebra  summarized,  it  can  be  shown 


(19) 


(see  Appendix)  that  for  > c^  , 
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W ± PE(ni)  - W • 


Therefore, 


^ W - J VV  i J W • 

n -c  +1 


nl“Cl+1 


1 1 


W 


AVAIL..  > AVAIL,,,.  > AVAIL,  , 
E uhi  A 


where 


AVAIL 


GE 


i W ■ 


"i'ci+1 


(20) 


(21) 


and  where  the  subscript  GE  represents  an  availability  determined  by  using 
the  general  time  probability  (versus  a failure  point  probability)  from  the 
exact  model. 

Inequality  (20)  is  a key  result.  It  shows  that  for  a given  set  of 
system  parameters  (N,  c^,  p^,  i=l,...,K)  , the  true  (exact)  availability 

of  the  system  is  greater  than  that  predicted  by  the  approximate  model  or  by 
using  the  general  time  probability  (versus  the  failure  point)  of  the  exact 
model . 


In  other  words,  the  approximate  and  GE  availabilities  are  conserva- 
tive, i.e.,  if  one  uses  the  approximate  or  GE  approach  to  determine  system 
parameters  to  yield  a certain  availability,  then  the  true  system  avail- 
ability will  actually  be  greater  than  the  original  target. 


4.4  Relationship  Between  the  Two  Models 

The  exact  model  development  involves  the  concept  of  a fixed  flow 
rate  T throughout  the  cycle.  Consider  the  flow  coming  out  of  the  first 
stage.  If  there  is  only  one  customer  in  service,  the  output  rate  from  the 
stage  is  equal  to  the  service  rate,  p^  , for  one  channel.  If  two  are  in 

service,  the  output  rate  is  2p^  . The  output  rate  increases  linearly 

with  the  number  of  customers  in  service,  up  to  the  maximum  service  rate  of 
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• These  different  output  rates,  weighted  by  the  probability  of  each 
rate,  are  what  make  up  T . 

In  the  exact  model, 

C1  N 

' ' Ao  "lUl  Pe<',|>  * J „ Vl  W • 

n^“t)  n^Cj+l 

or,  after  some  algebra, 

C1 

r * Cigi  " l o (crni)pE(ni)  * (22) 

The  second  term  is  negligible  under  a high  availability  constraint 
(i.e.,  there  is  probably  a queue  at  Stage  1,  which  implies  n ^ is  probably 

greater  than  c , which  implies  P (n  ) small  for  n.  < c.  ).  Therefore, 
I LI  I 1 

under  a high  availability  constraint, 

r « c1ul  , (23) 

which  is  a pure  Poisson  output  with  parameter  CjV^  . This  is  what  the 

approximate  model  uses  as  output  from  its  first  stage  ("infinite  source") 
and  input  to  its  second  stage. 

In  summary,  the  exact  model  uses  T from  Equation  (22)  and  the 
approximate  model  uses  T from  Equation  (23). 

5.  Computations 

5.1  Data 

The  following  data  were  obtained  from  [5].  The  application  was  to 
a fleet  of  gas  turbine  engine  ships,  where  the  decision  variables  were  the 
number  of  spare  engine  parts  to  supply  and  the  number  of  repair  channels 
to  provide  in  order  to  minimize  costs  and  to  satisfy  a high  (.90)  avail- 
ability constraint.  Each  engine  had  two  components,  a gas  generator  and  a 
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power  turbine,  requiring  separate  repair  facilities;  however,  only  one 
component  will  bo  analyzed  here.  There  are  four  stages:  operating, 

removal,  transportation,  and  repair.  Two  cases  are  examined,  one  where  it 
is  desired  to  have  10  operating  engines,  and  the  other  where  it  is  desired 
to  have  28  operating  engines.  The  removal  and  transportation  stages  have 
ample  servers,  i.e.,  no  queues  ever  form  in  front  of  Stages  2 and  3.  The 
parameters  for  the  gas  generator  component  (the  more  critical  of  the  two) 
are  given  below: 

* failure  rate  “ .00147186  failures/day 

« removal  rate  = .5  removals/day 

“ transportation  rate  = .1  transports/day 

p^  “ repair  rate  = .01887  repairs/day 


or,  in  terms  of  the  inverses, 

1/VJj  * 679.4  days  between  failures 

l/p^  “ 2 days  to  remove  a machine 

1/p^  » 10  days  to  transport  a machine 

1/p^  = 53  days  to  repair  a machine 

Cj  ■ 10  ; 28  = number  of  operating  machines 

c2  “ c3  * " 00 " “ ample  servers  for  removal  and 
transportation. 


Decision  variables: 


c.  “ number  of  repair  channels  at  the  repair  facility 
4 

N * total  number  of  customers  in  the  system  = number 
of  spares  + c^  . 


Reference  [5]  defined  an  objective  function  using  cost  data  to  arrive 
at  optimal  values  for  the  decision  variables  (number  of  spares,  number  of 
repair  channels).  In  this  paper  no  objective  function  will  be  formed. 

Only  the  effect  of  varying  the  decision  variables  on  the  availability  for 
the  exact  and  approximate  models  will  be  analyzed  for  the  purpose  of  de- 
termining the  accuracy  of  the  approximate  model. 
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5.2  Discussion  of  Results 

Figure  6 shows  the  results  for  the  case  with  10  operating  engines 

(c,  * 10)  . The  abscissa  is  the  number  of  repair  channels,  c.  , at  the 
I 4 

fourth  stage  (Stages  2 and  3 have  ample,  or  infinite,  servers),  and  the 

ordinate  is  the  availability  of  spares. 

The  curves  plotted  represent  the  availability  as  a function  of  c, 

4 

for  a fixed  number  of  spares.  The  availability  is  computed  three  ways: 

by  the  exact  model,  AVAIL  , Equation  (13);  by  the  approximate  model,  AVAIL., 

“ A 

F.quation  (19);  and  b'f  the  general  time  probability  of  the  exact  model, 

AVAIL  Equation  (21).  Note  how  the  computed  availabilities  are  consistent 

with  Inequality  (20),  which  shows  the  GE  and  approximate  models  to  be 
conservat ive. 

Just  to  the  right  of  the  model  designation  (AVAIL  , etc.)  is  the 

E 

computer  time  and  percentage  error.  The  time  is  the  average  number  of 
"system  seconds"  (a  combination  of  compilation,  execution,  input /output , 
etc.)  of  computer  time  used  in  the  calculation  of  any  given  point  on  that 
curve.  The  percentage  error  is  relative  to  the  exact  model  and  is  computed 
after  the  apparent  asymptote  has  been  reached.  Percentage  errors  are  also 
shown  at  two  other  points  on  each  curve,  before  the  asymptote  is  reached. 

For  example,  for  the  one  spare  case,  the  approximate  model  took 
about  3. 1 system  seconds  to  compute  eacli  point  on  the  curve  (which  is  com- 
posed of  about  five  points),  and  the  error  was  about  4%  for  points  with 
more  than  three  servers.  This  error  increased  to  6%  for  two  servers,  and 
to  36Z  for  one  server.  Note  that  the  number  of  servers  is  with  respect  to 
the  fourth  stage  of  this  cyclic  queueing  system. 

The  results  indicate  that  the  availabilities  quickly  approach  hori- 
zontal asymptotes  as  the  number  of  servers  increases.  In  addition,  the 
asymptotes  for  E,  GE  and  A more  quickly  approach  each  other  as  the  number 
of  spares  Increases,  i.e.,  the  percentage  error  from  not  using  the  exact 
model  decreases. 
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A key  observation  is  that  if  the  availability  were  constrained  to 
be  "high,"  say  .9,  then  the  objective  function  for  a particular  application 
would  only  be  evaluated  over  a region  where  the  number  of  servers  is  greater 
than  two  and  the  number  of  spares  is  greater  than  three.  In  this  region, 
the  approximate  and  exact  models,  as  well  as  the  exact  general  time  probabil- 
ity approach,  yield  almost  the  same  results. 

In  Figure  7 we  have  similar  trends  for  the  case  where  c^  = 28  (i.e., 

I l 

28  machines  are  to  be  operating).  Here  an  availability  constraint  of  .9 
would  force  the  objective  function  to  a range  in  which  there  are  at  least 
four  servers  and  six  spares.  Again,  the  approximate  model  and  the  exact 
general  time  approach  yield  very  good  results  with  respect  to  the  exact  model. 
However,  note  how  the  computer  times  vary.  The  approximate  model  takes 
only  about  one-sixth  of  the  time  for  almost  the  same  accuracy. 

This  time  difference  can  be  explained  by  looking  at  the  algebraic 
summary.  Section  4.3.  It  takes  more  time  to  compute  the  constant  of  inte- 
gration, A^  , Equation  (7),  for  the  exact  model  (versus  Aj  , Equation 

(16),  of  the  approximate  model),  but  this  is  but  a small  part  of  the  dif- 
ference. Equation  (13)  for  the  exact  model  requires  the  computation  of 

a constant  A , Equation  (12),  which  involves  P (n  ) for  all  n from 
J Ell 

0 to  N,  the  maximum  in  the  system.  On  the  other  hand,  the  approximate 

model  only  involves  P (n  ) for  n from  c +1  to  N in  Equation  (19). 

A i 1 1 

Considering  that  for  each  value  of  n^  both  the  exact  and  the  approximate 
models  must  go  through  the  partitioning  of  N-n^  customers  into  K-l  sub- 
sets (i.e.,  the  set  in  Equations  (10)  and  (17)),  the  approximate  model 

becomes  increasingly  easier  than  the  exact  model  to  evaluate  as  N increases 
(where  N ■ c^  + number  of  spares).  Thus,  for  Cj=*l0  (and  N ranging  from 

11  to  15)  the  time  differences  were  relatively  great,  but  in  magnitude  not 
startling;  whereas  for  c^“28  (and  N ranging  from  30  to  34),  the  computa- 
tion time  differences  were  significant,  both  relatively  and  absolutely. 
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When  comparing  the  approximate  model  with  the  general  time  prob- 
ability of  the  exact  model  in  Equation  (21)  (the  GE  model),  the  approximate 
model  is  faster  than  the  GE  model  due  to  the  time  involved  in  computing  the 
constants  and  , Equations  (16)  and  (7),  respectively.  However,  this 

general  time  probability  from  the  exact  model  approach  still  yields  substan- 
tial time  savings  over  the  exact  model  because  the  constant  A^  , Equation 

(12),  is  not  Involved  in  the  GE  model. 

Some  observations  on  the  asymptotic  behavior  of  availability  are 
worth  noting.  Figures  6 and  7 clearly  indicate  that  availability  can  be 
increased  only  to  a certain  value  as  the  number  of  servers  at  Stage  4 is 
increased  (for  a constant  number  of  spares).  Obviously,  when  the  number 
of  servers  exceeds  the  total  population  in  the  system,  the  availability 
can  no  longer  be  affected.  But  the  asymptote  is  reached  long  before  the 
number  of  servers  becomes  "ample"  at  Stage  4. 

There  appears  to  be  a similar  asymptote  for  availability  given  a 
fixed  number  of  servers  and  an  increasing  number  of  spares  (i.e.,  a vertical 
cut  in  Figures  6 and  7 to  derive  a cross  plot).  If  contours  of  constant 
availability  in  server-spare  space  are  plotted,  a picture  such  as  Figure  8 
results. 

One  final  note  on  Figure  7:  for  less  than  three  servers  at  Stage  4, 
the  approximate  model  cannot  be  used  because  the  condition  c^  < c^  for 

i“2,...,K  is  not  satisfied.  Thus,  the  approximate  model  has  a limit  on  its 
applicability.  However,  the  approximate  model  was  formulated  with  a high 
availability  constraint  at  Stage  1 in  mind.  For  c^  > c^  there  is 

great  congestion  at  Stage  4 rather  than  at  Stage  1,  and  the  high  availability 
constraint  at  Stage  1 is  not  satisfied. 

6 . Conclusions 

This  paper  has  formulated  and  compares  an  exact  and  approximate 
model  to  handle  an  extended  machine  repair  with  spares  problem.  Under  a 
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"high"  availability  constraint,  the  approximate  model  was  shown  to  be  compu- 
tationally more  efficient  and  almost  exact  in  its  results.  Furthermore, 
the  approximate  model  was  shown  to  be  conservative,  i.e.,  it  always  under- 
estimated the  true  availability  of  the  system. 

A third  approach  to  computing  availability  was  investigated.  This 
involved  using  the  general  time  probabilities  from  the  exact  model  (versus 
the  exact  failure  point  probabilities).  The  results  were  between  the  exact 
and  approximate  models  in  computation  time  and  precision. 

The  main  conclusion,  then,  is  to  use  the  approximate  model  if  a 
"high"  availability  constraint  is  present,  and  the  exact  model  otherwise. 

Ir  fact,  one  may  be  forced  to  use  the  exact  model  for  situations  where 
c^Uj  is  not  strictly  less  than  c^y^  for  1=2,..., K (which  violates  the 

basic  assumption  of  an  M/M/c  queue).  In  those  low  availability  cases  where 
N is  "large,"  the  general  time  (versus  failure  point)  probabilities  for  the 
exact  model  can  be  used  to  save  computer  time. 
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APPENDIX  A 

Proof  That  Qg^)  > p£(n1)  > Q^n^ 


for  tij  > Cj 


This  appendix  proves  algebraically  that  the  failure  point  prob- 
ability in  the  exact  model  is  greater  than  or  equal  to  the  generaL  time 
probability  in  the  exact  model,  which  is  greater  than  or  equal  to  the  failure 
point  probability  in  the  approximate  model,  for  n^  customers  at  Stage  1, 

when  n^  is  greater  than  c^  , the  number  of  servers  at  Stage  1. 


Since  QA(n^)  “ PA^nl^  » the  approximate  model  failure  point 

and  general  time  probabilities  are  equal),  then  we  want  to  show 


Qg^)  il  PE*nl^  - PA^ni ) » for  ni  > ci  * (Al) 

where,  repeating  relationships  and  notation  here  for  convenience, 

QE (n 1 > * failure  point  probability  in  the 

exact  model  that  there  are  exactly 
n^  customers  at  Stage  1 

= A;jc1p1  Pg(nj)  ; n3  > cL  . (11) 

(ni ) * general  time  probability  in  the  exact 
model  that  there  are  exactly  n^ 

customers  at  Stage  1 


P^n^  ) " general  time  probability  in  the  ap- 
proximate model  that  there  are  exactly 
n^  customers  at  Stage  1 
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t 


**1  LL(?) 


i_ 

b, 


(17) 


t-i'V  hi 


(7) 


! ? (ciPi 

)\ 

jn^oV  *i 

1 bi 

(16) 


-1 


1 N 

* „ Vl  PE(ni)  + ^ C1U1  PE("l) 


n =0 


nl“Cl+l 


(12) 


c = number  of  parallel  servers  at  Stage  1, 

1 1-1,2 K 

U = service  rate  of  each  server  at  Stage  1, 

1 1-1,2,.. .,K 

- number  of  customers  at  Stage  1,  1-1, 2,..., K 
K = total  number  of  stages 

N - total  number  of  customers  in  the  K stages 

K 

l -i 

1-1 


V 


n < c 
i - i 


, "i  C1 

lCi!  C1  ; ni>Ci 


(A2) 


(n  ; 1-2, 3,..., K : £ n - N-n  } 

1 1-2  1 1 


(9) 


(n  ; 1-1,2,. ,.,K  : £ n - N) 

1 1-1  1 


(8) 


With  the  notation  established,  consider  first  the  inequality 
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W - PF.(nl)  ; ni  > C1 


Substituting  (11)  and  (12), 


Vl  W 


; VV  • 


J nlUl  W+  I c,Ul  PE(„,) 

1 1 1 1 

Under  the  reasonable  assumption  that  Pp(nj)  ^ 0 for  n =0,1,. 
above  reduces  to 


? C1 


c,  > 5”  n,P  fn,)  + Y c P (n  ) 

1 ' n>0  1 E 1 n =c  +1  1 E 1 

l 1 1 


Since  y P (n. ) = 1 , c.  can  be  written  as 
n^O  E 1 1 


'■  ■ w + j w » |0  ♦ „ j 

1 i ii 

wlilch  reduces  to 


? C1 


„L  c^<n>)  - „L„  v.<-i>  • 


which  is  true  since  ^ in  the  right-hand  summation  Is  always  less  than 
equal  to  c^  . Therefore, 


QE(nl>  > pE(n|)  5 ni  ? ci 

- - 


Now  it  remains  to  prove  the  second  part  of  Inequality  (Al): 

W i W « "!»«!• 

Substituting  (10)  and  (17), 
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K 

fu. 

! n 1 

l 

1 i»l  ' 

{"U 

' hi 

In  order  to  get  some  cancellations,  we  change  tht^  index  in  the  product  term 
on  the  left-hand  side  and  remove  c^  from  the  product  term  (noting  that 
K 

£ n “ N-n  ) on  the  right-hand  side. 
i-2  1 1 

(vn4  \ n,  \ 

U,  \ , / ? N-n,  K /u,  \ , / 

"J  Sj  - *2  S2  i-2  W/  ^j' 

On  the  left-hand  side,  using  (A2), 


VC1 

bJ  = c]  • c!  ; *>!  > Cj  . 


Under  S2  from  (9),  n^  is  a constant,  and  we  can  move  terms  outside 
the  summations  to  get 

n,  \ / n , 

Ai  K /u,  \ i / ? N-n,  K /u  \ 1 , I 

' , /vct  L i-2  w n I - c>  L 1-2  (0  ■ 


C ! C 

1 ci 


Now  substitute  (7)  and  (16): 

1 


s^r  i a2  • 

V '1 


N-c.  ( K /u  \ 1 K » /c.u,\  , 

c,  1 l \ n (-M  U n l 1- 

1 s'  ) i- 1 \ Ui  / bi(  i-2  n -0  \ ui  / bi 


N nj( 

Noting  Cj  “ cj  • c2  • • Cj  , and  by  changing  the  index  on  the  product 

term  on  the  left-hand  side,  and  by  inverting  both  sides  and  changing  the 
direction  of  the  inequality, 
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K 

n f 


S,_  l i-2 


< 

it  - 


N N N 

l l I 

v°  vr°  v° 


K 

" fi 

i-2  1 


which  simply  bounds  the  summation  over  the  restricted  set  to  more 


manageable  summations  to  N . Therefore,  the  final  step  is  to  show 

.1  "I  ‘ 

Substituting  the  value  of  f , the  inequality  to  prove  now  is 


N N N 

l •••  l l 


K 

n f 


V° 


n^-0  n^-O  t i-2 


? K 

< n 


i-2  l n t=0 


? ? (cjh)2  (cjh\K  * 

Jo  Jo  V / b2  \VJ  VW/  bK 


I 


It  is  clear  that  the  product  of  infinite  sums  on  the  right-hand  side  will 
generate  all  the  terms  generated  by  the  finite  sums  of  the  p>  -ducts  on  the 
left-hand  side  plus  an  infinite  number  more  of  miscel 1 anei  us  toss  products. 
Since  all  terms  are  positive,  the  inequality  is  Satisfied.  I ..  refore. 


W i W 

"l  > C1 

I 

This  completes  the  proof. 


V"l>  ± W i VV  • "l  > C1 


(Al) 
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